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THE WARWICK AUTOMATIC GROUPS SOFTWARE 


Derek F. Holt 


Abstract. This paper provides a description of the algorithms employed by the 
Warwick Automata package for calculating the finite state automata associated 
with a short-lex automatic group. The aim is to provide an overview of the whole 
process, rather than concentrating on technical details, which have been already been 
published elsewhere. A number of related programs are also described. 


1. Introduction 

The purpose of this paper is to provide a description of the algorithms employed 
by the Warwick Automata package for calculating the finite state automata as¬ 
sociated with a short-lex automatic group. Such a description, with full technical 
details and proofs, has already been published in [2]. Our aim here, however, is to 
attempt to give a less technical overview of the full procedure. This overview also 
serves to present a more accurate and up-to-date description of the procedure than 
[2]. Some recent examples on which Automata has been successfully run will then 
be discussed. Finally, we shall describe some associated programs, which perform 
various functions on the output. All of these programs are available from Warwick 
by anonymous ftp. (Please e-mail the author for details.) 

It will be assumed that the reader has some familiarity with the definition and 
elementary theory of finite state automata. The best general reference for the theory 
of automatic groups is the six-author book [1], If A is a finite set (the alphabet), 
then A* will denote the set of all strings of elements of A, including the empty 
string, which will be denoted by e. Elements of A* will be referred to as words in 
A. In this paper, A will always denote a finite set which generates a group G as 
a monoid, so each word in A can be regarded as an element of G. If u and v are 
words in A, then we shall use u = v to mean that u and v are equal as elements of 
G, whereas u = v will mean that u and v are equal as strings. 

Let A and G be as above. Then G is said to be automatic (with respect to A), if 
it has an automatic structure. This consists of a collection of finite state automata. 
The first of these, usually denoted by W, is called the word-acceptor. It has input 
alphabet A, and accepts at least one word in A mapping onto each g £ G. The 
remaining automata M x , are called the multipliers. There is one of these for each 
generator x £ A, and also one for x = Ida (which will be denoted by M$). They 
read pairs of words (uq, W 2 ), with uq, u >2 £ A*, and accept such a pair if and only if 
uq and W 2 are both accepted by W, and W\x = u> 2 - (We shall clarify the technical 
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problem of dealing with words w\ and u >2 of different lengths at the beginning of 
the next section.) 

Notice that this definition apparently depends on the particular monoid generat¬ 
ing set A for the group G, as well as G itself, but it is not too difficult to prove (see 
Section 2.4 of [1]) that the property of being automatic is preserved under change of 
generators, and so we can unambiguously say that a particular group is automatic 
or not automatic. Although it is not necessary theoretically, the programs we are 
discussing require that A is closed under inversion (that is, for each x £ A, there 
exists y £ A with xy = Ido), and so we shall assume this from now on. 

The definition says nothing specific about the nature of the words accepted by the 
word-acceptor. We shall be dealing here exclusively with so-called short-lex word- 
acceptors, which we shall now describe. For this, we require an ordered monoid¬ 
generating set A for the group G. For words u and v in A, we write u -< v if either u 
is shorter than v. or u and v have the same lengths, but u comes earlier than v in the 
lexicographical ordering of A* relative to the given ordering of A. So, for example, 
if A = {a, h} with a < b, then £ -< a -< b -< aa -< ab -< ba -< bb -< aaa -< aab. This is 
known as the short-lex ordering on A*. A word w in A is said to be irreducible , if 
it is the least word in the short-lex ordering which is equal to w as an element of 
G. Otherwise, it is called reducible. 

The group G is called short-lex autoviatic with respect to the order monoid¬ 
generating set A, if it has a short-lex automatic structure. This is an automatic 
structure in which W accepts precisely the irreducible elements in A*. In particular, 
it accepts a unique word for each group element, and accepts the empty string as 
representative of Ida- Note that the equality multiplier M$ accepts (u,v) if and 
only if u = v and u is accepted by W. 

It turns out that the existence of a short-lex automatic structure for G definitely 
does depend on both the choice and the ordering of A. Some important classes of 
groups, including word-hyperbolic and finitely generated abelian groups have short- 
lex automatic structures for any choice of ordered generating set. Other groups, 
including many Euclidean groups (finitely generated free abclian-by-finite) have 
short-lex automatic structures for some choices but not others. Finally, there are 
some groups, such as the braid groups B. n for n > 4, which are automatic, but 
probably have no short-lex automatic structures at all. 

The Warwick Automata package can (currently) only handle short-lex auto¬ 
matic structures. It cannot provide any information at all on whether a group 
might be automatic if it has no such structure. However, if the group does have 
such a structure using a particular ordered generating set A then, in principle, given 
enough time and space, the programs would be capable of verifying this fact, and 
calculating the associated finite state automata explicitly. I am not aware of any 
existing software which can do the same thing for automatic structures which are 
not short-lex. In Chapter 6 of [1], an algorithm is described for deciding whether 
a given collection of finite state automata consists of the automatic structure for 
some (not necessarily short-lex) automatic group G. This algorithm has two short¬ 
comings. Firstly, it does not necessarily determine the group G up to isomorphism 
(although this would probably be fairly clear in most cases). Secondly, and more 
seriously, it does not make any suggestions as to how to come up with the automata 
for testing. 

The Automata package for short-lex automatic groups takes a finite group 
presentation as input (it is proved in Theorem 2.3.12 of [1] that all automatic 
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groups are finitely presented). The ordering of the generating set is defined by the 
order in which the generators are input, and inverses are inserted directly after 
each generator, unless they are explicitly specified elsewhere. The program first 
attempts to produce likely candidates for the automata in a short-lex automatic 
structure for G. It then uses a method similar to that described in Chapter 6 of 
[1] to verify the correctness of these automata. However, in the short-lex situation, 
the details are considerably simpler than in the general case and, furthermore, it is 
verified that the automata really do correspond to the given group G. Although it is 
perhaps a somewhat rash claim to make, I believe that the fact that these programs 
effectively verify the correctness of the results that they produce renders the results 
intrinsically more reliable than those produced by many computer programs. 

The remainder of the paper is organized as follows. In Section 2, we discuss 
two-variable finite state automata, logical operations on finite state automata, and 
introduce the concept of word-difference automata, which is central to the main 
algorithm. In Section 3, we describe the algorithm itself. In Section 4, we give 
a few examples with time and space requirements, in order to give an idea of 
the current scope of the procedure. Finally, in Section 5, we discuss briefly some 
related programs that are available with the Warwick package, which take a short- 
lex automatic structure as input. These are really just operations on finite state 
automata, and are not inherently dependent on the group structure. They include 
methods of determining the cardinality of the accepted language, enumerating the 
accepted language, and determining the growth series of the accepted language. By 
applying the last of these to the word-acceptor in a short-lex automatic structure, 
we obtain the growth series of the group. 

Nearly all of the code in the Warwick package was written by the author, David 
Epstein and Sarah Rees. A procedure for minimizing the number of states in a 
finite state automaton accepting a given language was written by Dave Pearson. 
The growth-series code was written by Uri Zwick. 

2. Finite State Automata 

Let X be a finite state automaton with input alphabet A. We denote the set 
of states of X by <S(X) and the set of accept states by A(X). The programs in 
Automata deal exclusively with partial deterministic automata , and all automata 
mentioned in this paper are of this type. (See Definition 1.2.5. of [1].) They have 
a unique initial state <jq(X), and for each x £ A and a £ S(X), there is either 
a unique transition from a with label x, or no transition. In the first case, we 
denote the target of the transition by cr x , and in the second case we say that a x is 
undefined. We can then define a u , for a £ S(X) and u £ A* in the obvious way. 

The automata in an automatic structure of a group G may be one-variable or 
two-variable. The word-acceptor is one-variable, and has input alphabet a monoid 
generating set A of G. The multipliers (and other automata, to be introduced 
later) are two-variable, and the idea is that they should read two words u and v in 
A simultaneously, and at the same rate. This creates a technical problem if u and 
v do not have the same length. To get round this, we introduce an extra symbol $, 
which maps onto the identity element of G, and let A^ = A U {$}. Then if (it, v) is 
an ordered pair of words in A , we adjoin a sequence of $’s to the end of the shorter 
of it and v if necessary, to make them both have the same length. The resulting 
pair will be denoted by (u,vy, and can be regarded as an element of (AT x A')*. 
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Such a word has the property that the symbol $ occurs in at most one of u and v, 
and only at the end of that word, and it is known as a padded word in A x A. We 
shall assume from now on, without further comment, that all of the two-variable 
automata that arise have input language A t x A' and accept only padded words. 

The Automata package makes heavy use of logical operations on finite state 
automata. Let X and Y be two finite state automata, with the same input alphabet, 
and let the accepted languages be L(X) and L(Y), respectively. Then there are 
straightforward algorithms for constructing finite state automata X ■ Y, X ', X AY 
and IVf, with accepted languages L(X)L(Y), A* \ L(X), L(X) fl L(Y) and 
L{X) U L(Y), respectively. If Z is a two-variable automaton accepting padded 
words in A x A then there is one-variable automata E(Z ), with input alphabet A , 
that accepts u £ A* if and only if there exists v £ A* with (u , u) 1 ' £ L(Z). (Clearly 
the same applies with u and v interchanged, and we can construct the corresponding 
“for all” automata by using a combination of negation and “there exists”.) These 
operations are all implemented in Automata. The algorithms are described in 
Chapter 1 of [1], and we shall not go into details here, except to mention that the 
construction of E(Z) amounts to constructing a deterministic automaton equivalent 
to a non-deterministic one, and potentially involves increasing the number n of 
states of Z to 2 n states of E(Z). This is one of the reasons why the programs can 
be very space-demanding for larger examples. Automata also contains a procedure 
for replacing an automaton X with an equivalent one (that is, one accepting the 
same language) with the minimal number of states. This is applied routinely after 
all of the logical operations that we have just described. 

The principal difference in emphasis between the current paper and [2] is that 
we shall describe as much of the algorithm as possible in terms of these logical 
operations on finite state automata. In [2], we gave more detailed direct descriptions 
of the individual procedures, and put much more emphasis on the string-rewriting 
aspects. I hope that the current approach is conceptually easier to understand. 

We shall now introduce the concept of word-difference automata , which play a 
central role in the programs. Assume, as usual, that A is a monoid generating set 
for a group G. Then a two-variable automaton Z accepting padded words in Ax A 
is called a word-difference automaton, if there is a function a : S(Z) —► G such that 

(i) a{a 0 (Z)) = Id G , and 

(ii) for all x, y £ A^ and r £ S(Z) such that rWsd is defined, we have afr^^) = 
x~ 1 a(r)y. 

We shall assume that all states r in a word-difference automaton Z are accessible; 
that is, there exist words u,v in A such that <tq(Z)( u,v ^ = r. It follows from 
Properties (i) and (ii) that a(r) = u -1 u, and so the map a is determined by the 
transition function of Z. We shall also assume that each accept state is mapped by 
a onto the same element g of G. We then say that Z is a word-difference automaton 
accepting g. 

It is proved in Theorem 2.3.2 of [1] and Theorem 2.3 of [2] that any automatic 
structure has the so-called fellow-traveler property. This means that there is a 
finite set WV (the word-differences) of elements of G with the following property. 
If u,v £ L{W) and either u = v or u~ 1 v = x for some x £ A, and p(u) and p{v) 
are prefixes of u and v having the same length, then p(u)~ 1 p(v) £ WV. We also 
insist that x £ WV for all x £ A U {Ida}- We can therefore define associated 
word-difference automata WD X , for x £ A U {Ida}, in which the set of states is 
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WT>, a is the identity map, and t^ x,v ' 1 is defined if and only if x~ 1 ry £ WT>, in 
which case it is equal to x~ 1 ry. The initial state is Ida , and there is a single accept 
state x. 

It is clear from all of this (c.f. Theorem 2.3.4 of [1] and Corollary 2.4 of [2]) 
that the multipliers M x in an automatic structure accept (u, v )' if and only if WD X 
accepts (u, and the word-acceptor W accepts both u and v. As we shall see in 
the next section, this fact can be used to construct them. 

There are two other word-difference automata which arise in the algorithms. We 
define L\ to be the set of padded words (u, in Ax A with the following properties 

(i) u = v in G; 

(ii) v is irreducible; 

(iii) u is reducible; 

(iv) u = tx with t £ A* , x € A, and t is irreducible; 

(v) u = ys with s £ A*, y £ A, and s is irreducible. 

So, if we think of the elements of L\ as reduction rules for G, where we replace 
occurrences of the left hand sides of rules by the corresponding right hand sides, 
then L\ is the minimal set of reduction rules necessary to reduce any word in A to 
its irreducible representative. If G has a short-lex automatic structure with respect 
to A , then it follows from the fellow-traveler property that the automaton WD$ 
accepts all words {u,v)^ in L\. (This is because WD X accepts (t, v)^ with t as in 
(iv).) Since the word-acceptor W accepts precisely the set of irreducible words, it 
is clear that the words (u, vy satisfying properties (ii)-(v) can be recognized by 
some finite state automaton X. Since the words accepted by WD$ satisfy property 
(i), it follows that X AW D$ is a word-difference automaton accepting the language 
L\. More generally, we shall say that a word-difference automaton accepting Ido 
has property V\, if its language contains L\. 

We define L 2 to be the same as L\, except that we omit property (v) from 
the definition; so L 2 is usually larger than L\. Again, there is a word-difference 
automaton accepting L 2 , and a word-difference automaton accepting Ida is said to 
have property V 2 if its language contains £ 2 - In particular, WD$ has property AT 

In the programs, we shall be constructing automata D 1 and D 2 which we hope 
will have properties V\ and V 2 , respectively. (These are referred to as the principal 
transition table and the product transition table in [2].) 

3. Description of the Algorithm 

To understand the algorithm, it is essential to distinguish between the theoretical 
and the practical points of view. The material in Section 2 was completely theo¬ 
retical. We assumed the existence of a short-lex automatic structure for a group 
G, and used the associated automata W and M x to deduce the existence of WD X , 
and word-difference automata with properties Vi and V 2 - 

The algorithms also assume the existence of an automatic structure, but their 
aim is to construct this structure, and W and M x are not explicitly available at the 
beginning. Thus we cannot use them to construct the word-difference automata 
D\ and Z? 2 . On the contrary, we first guess candidates for D\ and D 2 , and then 
use these to construct candidates for W and M x . and finally check the correctness 
of the complete automatic structure. 

Another point to be borne in mind is that we cannot solve the word problem 
in G until the whole process is complete. This means that we cannot necessarily 
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calculate the map a in the word-difference automata precisely, even though we 
know theoretically that it exists. The best we can do is to calculate a map d from 
the set of states to A *, where d(a) is the least element of A* (under the short-lex 
ordering) which is currently known to be equal to a (a) in G. In the construction 
of D\ and £> 2 , we really want the states to be elements of WV and a to be the 
identity map, and so in practice, the states are elements of A* which we cannot 
currently reduce, and a(a) is the identity map. Whenever we discover that two 
states are equal in G, we amalgamate them. 

We can now give a somewhat simplified breakdown of the complete algorithm. 
The input is an ordered monoid-generating set A of a finitely presented group G, 
together with defining relations for G given as words in A. In principal, it will 
succeed if and only if G is slrort-lex automatic with respect to A, in which case 
it will construct the finite state automata of the associated automatic structure. 
(After minimization, these automata are essentially uniquely determined, because 
their languages are uniquely determined by A and G.) 

Step 1. Construct a word-difference automaton D\ accepting the identity, which we 
hope has property V\. 

Step 2. Construct a candidate for the word-acceptor W. Construct the equality 
multiplier M$, which accepts the padded word (it, i>)f if and only if u = v £ 
L(W). 

Step 3. Construct a word-difference automaton D 2 accepting the identity, which we 
hope has property V 2 ■ 

Step 4. Construct candidates for the multipliers M x , for all x € A. If, during this 
construction, we find distinct words u and v in A , both accepted by W, 
with u = v in G, then return to Step 1. 

Step 5. Check correctness of £> 2 - If incorrect, return to Step 3. 

Step 6. Check correctness of W and M x . If incorrect, return to Step 1. 

We shall describe these steps in more detail. All of them except for Step 1 are 
essentially logical operations on finite state automata, and so we leave Step 1 until 
last. Assume then that we have completed Step 1, and let GT be the two-variable 
automaton which, for u,v £ A* accepts (it,u)f if and only ifuyv. In Step 2, we 
construct W using the standard finite state automaton operation 

W = ( A * ■ E(D± A GT) -A*)'. 

Expressed in words, W accepts w £ A* if and only if it does not have a substring 
u such that there exists v € A* with u >- v and (u,vy accepted by D\. It is clear 
that, if £>1 really does have property Vi, then W will accept the set of irreducible 
words; in other words, it will be correct. 

The converse is not necessarily true; that is, it is possible for W to be correct 
and for D\ not to accept everything in L\. However, what is true is that if W is 
correct and u is reducible (so not accepted by W), then there is a substring s of u 
such that £>1 accepts (s,tY for some s y t, and it is not difficult to use £>1 to find s 
and t. (This technique is contained in the proof of Theorem 4.1 of [2], and we shall 
not give further details here.) Thus, if W is correct, then we can effectively use D\ 
to reduce words in A* to their irreducible representatives. That is, we can solve the 
word problem in G and calculate the maps a associated with the word-difference 
automata correctly. The subsequent steps of the algorithm use this technique under 
the assumption that W is indeed correct. As we shall see shortly, if this is the case, 
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and G really is short-lex automatic with respect to A , then it is guaranteed that 
after a finite number of iterations through the loop in Steps 3-5, the multipliers 
M x will also be correct, and so Step 6 will succeed. 

Another significant property of the W constructed is that, even if it is incorrect, 
it always accepts at least one word for each group element. To see this, let g G G 
and let u be a word in A mapping onto G. Then we can use D\ to reduce u to 
a word v accepted by W, and since we are assuming that D\ accepts the identity, 
we have u = v — g. This means that the language accepted by W always contains 
the irreducible words; that is, it can be too large but never too small. Since Step 
6 effectively checks that W accepts only irreducible words, we know that, if Step 
6 succeeds, then the automatic structure that we have constructed is for the given 
group G. In the more general situation described in Chapter 6 of [1], nothing about 
W is assumed, and so it is possible, even if the verification process succeeds, that 
the automatic structure is for a proper factor group of the given group G. 

The equality multiplier M$ constructed in Step 2 is not really necessary but it 
is useful to have it available in Step 6. 

In Step 3, our aim is to make D 2 equal to the automaton WD$ defined in Section 
2. For this, we need a candidate for the word-difference set WD. For our initial 
candidate (on the first call of Step 3), we let X be the set of all images d(r) for 
t G iS(-Di), and put 

S{D 2 ) =lur 1 udu{e}. 

We do this because we know that WD is closed under inversion (since A is), and 
we require it to contain A U {e}. We then define D 2 exactly as specified for WD$ 
in Section 2. That is, for r G S(D 2 ) and x,y G A\ A x,v ' > is defined if and only if 
x~ 1 ry G S(D 2 ), in which case it is equal to x~ 1 ry. The initial state and the only 
accept state is e. The expressions x~ x ry are calculated using the word reduction 
algorithm mentioned above. Remember that, if W is correct, then so is the word 
reduction algorithm, and so S(D 2 ) can be regarded as a subset of G and a = a. 
It follows that, if W is correct and S(D 2 ) really does contain WD, then D 2 , as 
constructed, will have property V 2 . However, in many examples S{D 2 ) will not 
contain WD on the first iteration, and Step 5 tests for this, and produces some 
explicit elements in WD \ S(D 2 ) if it is nonempty. If G is short-lex automatic 
with respect to A, then WD is finite, and so we will only need to go through this 
iteration a finite number of times (assuming of course that W is correct). 

We next describe Step 4. We construct the multipliers exactly according to the 
definition of standard automata in Definition 2.3.3 of [1]. For each M x , the set of 
states is tS(IT) x 5(IT) x S(D 2 ) with initial state (cto(IF), (7q{W), <Jq(D 2 )), and for 
x,y G and (< 71 , 02 , r) G S(M X ), the transition (oq, <t 2 , t)^’ 2 ^ is defined to be 
equal to (<jf, <r|, r^’ y - ) ) provided all three components are defined, and undefined 
otherwise. The M x for different x differ from each other only in their accept states; 
the accept states of M x are all (<7i,<t 2 ,t) such that <ti,o- 2 G A(W) and d(r) = x. 
However, once the M x have been constructed in this fashion, they are immediately 
minimized, after which they no longer have the same sets of states, and are no 
longer necessarily word-difference automata. 

It sometimes happens that, during the course of the construction in Step 4, we 
find distinct words u , v in A, both accepted by W, for which u = v in G. This 
means that W is accepting two words for the same group element, and so it cannot 
be correct. We can therefore return immediately to Step 1. (This happens when 
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there is (u,v) € L{D 2 ) \ L(Di), where (u,v) ought to be in L(D\), and so D\ is 
incorrect.) Although we know of no theoretical reason why this should be the case, 
we have never yet encountered an example in which we have got through to Step 6 
and the axiom checking has failed. It seems that, if our automata are wrong, then 
this fact becomes clear either at this stage or at Step 5. 

Note that, if W is correct (which ensures that S(D 2 ) is a subset of G and a 
is the identity map) and D 2 accepts everything in L 2 (defined in Section 2), then 
S(D 2 ) must contain WD. and so the multiplers M x will be correct. Suppose that 
W is correct but M x is not correct, for some x £ A. From the construction of M x , 
it is clear that, if M x accepts (it, up, then W accepts u and v, and ux = v in G. 
Thus, there exist words u and v in A* such that W accepts u and v and ux = v in 
G, but M x does not accept (u,vy. But then, since v is the unique word accepted 
by W that is equal to ux in G, M x cannot accept (it, w)' for any w £ A*. So we 
can test for this, by constructing the automata W A E(M X )’ for each x £ A, and 
we do this in Step 5. If all of these automata have empty accepted language and 
W is correct, then the M x must also be correct, and we proceed to Step 6 for the 
final verification. Otherwise, we find some explicit words u which are not accepted 
by some W A E(M X )’, and we let v be the reduction of ux (using Z?i); then W 
accepts v and ux = v in G. Since M x does not accept (it, v), there must be prefixes 
p(u) and p{v) of u and v of the same length, such that p(u)~ 1 p(v) £ S(D 2 ). We 
therefore return to Step 3, and adjoin all such elements and their inverses to S(D 2 ), 
and then recalculate D 2 itself. 

Step 6 depends on the following logical automaton operation. If Z\ and Z 2 are 
any two two-variable automata with the same input alphabet AI x Ay then we 
define their composite to be the two-variable automaton with language 

{(u,n)t|3ii; G A* : ( u,w )^ £ L(-Zi) and (w,v)^ G L(Z 2 )}. 

In particular, for multipliers M x and M y . we denote their composite by M xy . The 
composite operation is easily seen to be associative, and so we can define M u for any 
u G A* by repeated application, where M e = M$. It is also easy to see that, if M u 
accepts (v,wy, then vu = w in G. The axiom checking process essentially verifies 
the converse of this statement. More precisely, let 1Z be the set of defining relators 
of G, as a monoid (so it includes the inverse relators xx _1 , for all x G A). Then, 
it is proved in Theorem 2.5 of [2], that if M r has the same language as M$ for all 
r G 1Z. then the automata W and M x are correct. So we can check this by repeated 
application of the composite operation. Since this operation involves making a 
non-deterministic automaton deterministic, it potentially involves an exponential 
increase in the number of states. We try to keep this under control by minimizing 
after every operation. With a long relator r it seems to be most efficient to split 
r roughly in half as r = st and then form M s and M t , and construct M r as the 
composite of these. In most examples, Step 6 takes the longest time, and is often 
the most space-demanding. 

Finally we turn to Step 1. Our current approach to this is to use the Knuth- 
Bendix procedure on the group presentation to derive a list of reduction rules of 
the form ( 11 , 11 ), where u and v are words in A, u = v in G and u >- v. Since 
we devoted a large part of [2] to the Knuth-Bendix procedure, we will not discuss 
it in detail here, but concentrate on its application to Step 1 of the procedure. 
At any time, the existing rules are used to reduce words w in A to their least 



WARWICK AUTOMATIC GROUPS SOFTWARE 


9 


possible representative, by substituting v for u whenever u is a substring of w and 
(u, v) is a rule. Occasionally, the Knuth-Bendix procedure terminates with a finite 
complete set of reduction rules, in which case these rules suffice to reduce any 
word to its irreducible representative in G. But this does not usually happen. For 
any particular w, it is guaranteed that we will eventually generate enough rules to 
correctly reduce w, but we have no way in general of knowing how long this will 
take, or even whether we can currently reduce w correctly. (For example, the group 
itself could be trivial, but it could take a long time for this fact to emerge.) 

At regular intervals, we interrupt the Knuth-Bendix procedure, and form a can¬ 
didate for the word-difference automaton D\ using the current set of rules. We do 
this as follows. S(D i) is initially empty. Consider each rule (u,v) in turn, and let 
u = X 1 X 2 ■ ■ ■ x m and v = y 1 J /2 ■ • • y n with m > n. We pad v if necessary in the usual 
way, by putting y, = $ for n < i < m. Let g?o = £ and for 1 < i < m let di be the 
reduction, using the current rules, of x~ 1 di-\yi (so di is a word-difference). Then 
we adjoin the states d, to S(D\) and the transitions d[^ Vt ^ = di to Di if any of 
them are not there already. Of course a maps a state to its image in G, and the 
start and accept states are both e. 

Suppose G really is short-lex automatic with respect to A. Then, as we saw 
in Section 2, there is a word-difference automaton that accepts the language L\. 
Since this automaton contains only finitely many states and transitions, it follows 
that these states and transitions will be manifested by a finite subset of L\. Thus 
once we have run the Knuth-Bendix procedure long enough to generate all of the 
rules in this finite subset, and we have enough rules to correctly reduce each of the 
transitions x~ 1 di-\yi described above, the candidate for D\ that we construct will 
have the required property Vi, and then the remaining steps 2-6 of the complete 
procedure will also complete successfully. 

The big problem is to recognize when this has happened and to proceed to Step 

2. Currently, we have no completely satisfactory heuristics for this. Our default 
method is to wait until several successive candidates for D\ have been identical; 
that is to wait until D\ has apparently become constant. This approach is successful 
for many straightforward examples. However, because the current system of rules 
at any time is not adequate to reduce all strings correctly, it happens sometimes 
that the D i constructed has superfluous states and accepts strings that are not 
in L\. This, in itself, does not matter provided that there are not too many such 
superfluous states, but it has the effect that it makes it difficult to recognize when 
D i has property Vi, because it never appears to become stable. In such cases, 
the user has to decide interactively when to stop. Of course, if Di appears to be 
increasing steadily and rapidly in size, then this is good evidence that G is not 
short-lex automatic with respect to A. 

Currently, if we need to return to Step 1 after Steps 3 or 6, then we begin 
again completely, because it seems to difficult (and space consuming) to attempt 
to restart from where we left off last time. As we mentioned above, we have never 
encountered an example in which Step 6 failed. It would probably be a good idea to 
insert the discovered identity u = v as a new rule if we return to Step 1 after Step 

3, but we are not currently doing that. Alternatively, we could simply recalculate 
D i after adjoining the new rule (u, v) and not use Knuth-Bendix again at all. We 
intend to experiment with such alternative strategies in the near future. Another 
simple policy, which we do use sometimes, is to use Di in place of Di, which seems 
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reasonable theoretically, since the language of D 2 is usually larger than that of D\. 
In fact, this is often effective; the only disadvantage is that it usually increases the 
space requirements of Step 2 quite considerably, and so it is not always practical. 

As we have seen, Steps 2-6 are more routine and predictable than Step 1, but 
they are subject to exponential increase in storage demands at certain points. Our 
experience with examples indicates that when D\ is correct (i.e. it has property V\, 
then the space requirements will be large, but not impracticably so, but when D\ is 
incorrect, the exponential increases will really occur and cripple the whole process. 
There is no known theoretical reason why this should be the case, but it is seems 
reasonable to suppose that when D\ is correct then the subsequent calculations are 
constructing natural objects associated with G, whereas when D\ is incorrect, the 
calculations are in some sense meaningless and therefore random. 

This situation indicates that the most important problem now is to try to find 
improved methods of carrying out Step 1. Improvements to the remaining steps 
would also be useful of course, but they are more likely to be purely computational 
in nature rather than algorithmic. In the Knuth-Bendix procedure, there is consid¬ 
erable freedom in the choice order in which we process overlaps. A strategy needs 
to be found which produces the finite subset of L\ mentioned above as quickly as 
possible. There may also be completely different approaches to Step 1, which do 
not involve Knuth-Bendix, but none has been found to date. 

4. Examples 

A number of relatively easy and straightforward examples in which the Au¬ 
tomata program runs successfully were described in [2]. Here we describe some 
examples of varying difficulty which have arisen in some natural context during the 
past few years, for which we have successfully computed the automatic structure. 
There are two shell-scripts available which run the complete procedure described 
in Section 3, by running the individual programs that perform the steps in the 
procedure in the correct order. These are called automata and automatac. The 
second of these has often proved more efficacious on the harder examples. 

The first example is a group that was used by Charlie Gunn at the Geometry 
Center, Minneapolis, in connection with drawing pictures of tessellations of hyper¬ 
bolic space with a dodecahedral fundamental region. Such pictures were used in 
the film "Not Knot” produced by the Center. The word-acceptor of the group is 
used to generate each group-element exactly once, which results in faster and more 
accurate drawings. The presentation is 

Gi = (a, b , c, d, e , /|a 4 , b 4 , c 4 , d 4 , e 4 , / 4 , abAe, bcBf , cdCa , deDb, efEc, faFd). 

(In all of the examples in this section, we use the inverse by case-change convention; 
that is A is the inverse of a , etc.) The automatac program completed in a total of 
545 seconds on a Silicon Graphics Iris XS24 machine, with 32 Megabytes of core 
memory available. The word-acceptor has 48 states and the multipliers range from 
145-175 states. This example went twice round the first loop (Steps 1-4) of the 
algorithm, and six times round the second loop (Steps 3-6). 

In the above example, and in many of those that follow, it would be possible to 
eliminate several of the generators before starting. Experience has shown that this 
is not a good idea in general. The programs appear to run best on the natural pre¬ 
sentations of the groups, rather than on presentations that have been manipulated 
by the user. 
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The second example is a group arising from a (2,2, 2, 2, 2, 2)-orbifold proposed 
by Bill Thurston. The presentation is 

G 2 = (a, b, c, d, e,f,g, h, i, j, k, l\ 

dcbadc, ( ahlE ) 2 , ( beJF ) 2 , ( cfKG ) 2 , ( dgLH ) 2 , (ilKjf). 

Initially this caused problems because of the large number of generators, which 
meant that the alphabet for the two-variable automata had size 361. To cope 
with this, we had to provide an optional new method of storage for automata, 
essentially storing the transition table as a sparse matrix, rather than as a simple 
two-dimensional array. The example then became tractable. The word-acceptor 
has 131 states and the multipliers range from 132-232 states. The total time on 
the SGI machine was 930 seconds. 

The Fibonacci groups have been favorite test examples in combinatorial group 
theory for many years. The groups F(2,n) (for n > 2, say) have the presentation 

F(2,n) = (ai,... ,a n \aia 2 = a 3 , a 2 a 3 = a 4 ,... , a„_ia„ = ai, a n ai = a 2 ). 

The group is finite when n = 3,4, 5 or 7. For n = 6 it is Euclidean (free abelian-by 
finite), and is an easy example for automata. For n > 8 and n even, they are 
known to be hyperbolic, and hence short-lex automatic. For n = 8, the automata 
program ran out of space on the SGI machine while computing the multipliers. It 
completed (in about 9 hours cpu-time) on a much slower Solbourne machine with 
128 Megabytes of memory. The word-acceptor has 212 states and the multipliers 
all have 1861 states. In this example, the automatac program went twice round 
the first loop, then twice round the second loop, and then needed to go another 
time around the first loop before completing. In other words D\ was wrong the 
first time (with 51 word-differences), then Z? 2 was wrong, and then it transpired 
that D\ was wrong again (with 54 word-differences). There are in fact 58 states 
(word-differences) in the correct D\ automaton. 

We have also successfully computed the automatic structure of F( 2,10), which 
is larger (and more difficult) than that of F( 2,8), but not disproportionately so. 
The examples for n odd are much more difficult, and it is not known whether or 
not they are hyperbolic. Since writing the first draft of this paper, the author has 
rewritten some of the procedures in Automata, using improved data structures 
(but no essentially new algorithms), and it has become possible to complete some 
larger examples. In particular, F{ 2,9) has been proved to be short-lex automatic 
using its natural presentation, and this is the largest automatic structure that we 
have successfully computed to date. The word-acceptor has 3251 states, and the 
multipliers about 25000 states, but they had about 860000 states before minimiza¬ 
tion. The complete computation took about 12 hours of cpu-time and used up to 
about 130 Megabytes of memory on a new SPARCstation 20. More details of this 
calculation can be found in [3]. This group had already been proved to be infinite 
by M. F. Newman in [4]; the automatic structure easily provides an independent 
proof of this fact, together with additional information, such as the fact that the 
generators have infinite order, which had not been known previously. 

We conclude by mentioning some further large examples which we have success¬ 
fully computed recently. The first of these is a presentation for the Picard group 
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P = SL( 2, R), where R is the ring of Gaussian integers, taken from [6]: 

P = (a,t,u,l,j | 

a 2 = l 2 = (at) 3 = ( al) 2 = (tl) 2 = ( ul ) 2 = ( ual ) 3 = j,j 2 = [t,u] = 1, j central). 

The word-acceptor has 403 states and the multipliers range from 403-3718. (The 
smallest is for the central generator j, which is of course —/ in SL(2,R).) The 
most remarkable thing about this example is that it was necessary to go 12 times 
around the second loop in the procedure. This suggests that techniques should be 
sought to find more missing word-differences at the same time. 

The following presentation was proposed by Andre Rocha: 

G = (a, b, c, d, e, f\BceFa, CdfBa , DebCa, EfcDa, FbdEa , ecfdb). 

(This is the fundamental group of a negatively closed Seifert-Weber manifold.) The 
word-acceptor has 1429 states, and the multipliers about 14000 states. 

The final presentation was proposed several years ago by Thurston, and repre¬ 
sents a knot with surgery. (The first two relations define the group of the knot, and 
the final relation the surgery. The computation for the knot group itself, obtained 
by omitting the final relation, is relatively straightforward.) 

G = (x,y,t\txT = xyx,tyT = xy, ( XYxy) 2 = t 3 ). 

The word-acceptor has 1188 states, and the multipliers about 9300 states. 

5. Related Algorithms 

In this final section, we mention some related algorithms that have been imple¬ 
mented at Warwick, which take an automatic structure of a group as input. The 
first few of these are fairly easy, and the implementations are distributed with Au¬ 
tomata. There is a program to enumerate the accepted words of a finite state 
automaton up to a given length, using a depth-first search. When applied to the 
word-acceptor of a slrort-lex automatic group, this produces a unique word for each 
group element, and is the basis of many of the applications of the programs to 
hyperbolic geometry, particularly to drawing pictures. There is a fast and simple 
program to decide whether the language of an automaton is finite and, if so, to 
determine its order. This can be used on the word-acceptor to determine the order 
of a finite group. (For most presentations of finite groups, Todd-Coxeter coset enu¬ 
meration provides the fastest mechanical way of determining the order, but there 
are some cases in which the automatic group method works faster.) 

If a group is word-hyperbolic, then the geodesics between any two vertices in 
the Cayley graph have the fellow-traveler property - that is they remain within a 
bounded distance of each other. It has recently been proved by Panos Papasoglu 
(see [5]) that the converse of this statement is also true; if the geodesics in the 
Cayley graph have the fellow-traveler property, then the group is hyperbolic. This 
means that there should be an automatic structure for the group in which the word- 
acceptor accepts all shortest words for each group element. There is a program 
available with the package which attempts to calculate this word-acceptor. If it 
terminates, then the geodesic word-acceptor that is calculated is guaranteed to 
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be correct and so, by the result of Papasoglu, we have verified that the group is 
hyperbolic. 

The algorithm used is as follows. We assume that some short-lex automatic 
structure for the group G has already been calculated. Assume also that we have a 
list WVQ of some or all of the word-differences that occur between geodesics in the 
Cayley graph of G , and that WVQ contains the set WV (see Section 3) that was 
calculated with the short-lex automatic structure. Let WDG be the corresponding 
two-variable word-difference automaton which accepts (u, v) if and only if u = v, 
u and v have equal lengths, and the word-differences arising from the two paths u 
and v all lie in WVQ. Now calculate a sequence Wi(i > 0) of finite state automata, 
where Wq is the short-lex word-acceptor, which we already have, and 

Wi+i = {w|3i> € L(Wi) with (u,v) € L(WDG)}. 

Then L(Wi) C L(Wi + 1 ) for all i, and if L(Wi) = L(Wi + 1 ) for some i, then L(Wi) 
is the set of all geodesic words, and so W % is the required geodesic word-acceptor. 
(To see this, let u be any geodesic word, let v be the short-lex reduced word equal 
to u in G, and let u = «o, ui,... , u n = v be a reduction of u to v where, for each i, 
( Ui , Uj+i) is in the language L 2 defined in Section 2. Since u is a geodesic, the 14 all 
have the same length and, since we are assuming that WVQ contains WV, we have 
(v,i,Ui+ 1 ) € L(WDG) for all i, and so u G L(W n ).) So if the algorithm terminates, 
then we know that the answer is correct. Conversely, if WVQ contains all the word- 
differences that arise from pairs (u,v), where u is a geodesic and v is the short-lex 
reduced word equal to u in G, then the algorithm will terminate; in fact W\ will 
already be the correct geodesic word-acceptor. The only problem with all of this 
is how to find the set WVQ. We can try WVQ = WV as a first attempt, but that 
is often not large enough. Currently, our method is to choose random geodesics u 
up to a specified length (40 or 50 is usually adequate in the examples that we are 
considering), reduce them to their short-lex minimal equivalents v, calculate the 
word-differences that arise from (u,v), and adjoin them to WVQ if they are not 
there already. A deterministic method for doing this has been proposed by David 
Epstein but has not yet been implemented. 

For example, in the group G\ of Section 4, \WV\ = 75 and the algorithm does 
not appear to terminate with WVQ = WV, but by choosing random geodesics 
we rapidly increase WVQ to size 103, and the procedure then terminates with the 
geodesic word-acceptor W\ having 64 states. In G 2 , WV has size 99 and is already 
adequate to calculate the geodesic word-acceptor, which has 156 states. We have 
not yet succeeded in calculating the geodesic word-acceptor for F( 2, 8 ). 

Finally, we should like to mention the growth-rate program, written by Uri Zwick. 
This is not supplied with Automata, but a binary executable for Sun 4 architecture 
is available on request. The growth-function for a group G with a given monoid 
generating set A is defined to be the formal power series /g,a(1) = 0 W (*)^> 

where w(i) is the number of elements of G for which the shortest word in A has 
length i. If we have a short-lex word-acceptor W for G with respect to A, then 
clearly w(i) is equal to the number of words of length i in L(W), and so /g,a( 1) 
can be calculated from W. It can be proved that the generating function of the 
language of a finite state automaton is always a rational function of t , and the result 
is output in this form by Zwick’s program. For example, the growth function for 
the free group on two generators is (1 + t)/(l — 3 1), whereas the growth-rate for the 
example Gi in Section 4 is (1 + 3f + 3t 2 + t 3 )/(l — 9t + 9t 2 — t 3 ). 
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